A data partitioning scheme for spatial regression

نویسندگان

  • Slobodan Vucetic
  • Tim Fiez
  • Zoran Obradovic
چکیده

Precision agriculture data consisting of crop yield and topographic features are examined with the objective of explaining yield variability as a function of topographic attributes in order to extrapolate this knowledge to unseen agricultural sites. It is demonstrated that random data partitioning into training, validation and test subsets is not appropriate when dealing with agricultural problems characterized with strong spatial data correlation. A simple spatial data partitioning scheme that leads to significantly faster neural network training and slightly better generalization is proposed. Also, integration of predictors formed from spatially partitioned data led to improved generalization over a bagging integration procedure in experiments. The margin between the best spatial model and a trivial predictor for our precision agriculture problem was small indicating that topographic features alone could explain only a small amount of the yield variability.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spatial Varying Coefficient Regression Model For Relative Risk Factors of Esophageal Cancer Patients

In conventional methods for spatial survival data modeling, it is often assumed that the coefficients of explanatory variables in different regions have a constant effect on survival time. Usually, the spatial correlation of data through a random effect is also included in the model. But in many practical issues, the factors affecting survival time do not have the same effects in different regi...

متن کامل

Spatial Correlation Testing for Errors in Panel Data Regression Model

To investigate the spatial error correlation in panel regression models, various statistical hypothesizes and testings have been proposed. This paper, within introduction to spatial panel data regression model, existence of spatial error correlation and random effects is investigated by a joint Lagrange Multiplier test, which simultaneously tests their existence. For this purpose, joint Lagrang...

متن کامل

Patchwork Kriging for Large-scale Gaussian Process Regression

This paper presents a new approach for Gaussian process (GP) regression for large datasets. The approach involves partitioning the regression input domain into multiple local regions with a different local GP model fitted in each region. Unlike existing local partitioned GP approaches, we introduce a technique for patching together the local GP models nearly seamlessly to ensure that the local ...

متن کامل

Assessment of the Performance of Clustering Algorithms in the Extraction of Similar Trajectories

In recent years, the tremendous and increasing growth of spatial trajectory data and the necessity of processing and extraction of useful information and meaningful patterns have led to the fact that many researchers have been attracted to the field of spatio-temporal trajectory clustering. The process and analysis of these trajectories have resulted in the extraction of useful information whic...

متن کامل

The R⊕-tree: Incorporating Object Partitioning into the R-tree

During the past three decades, researchers devoted much effort to developing efficient techniques to index spatial data. Of those proposed, the R-tree[1] is perhaps the most important. R-trees are ubiquitous in commercial database management systems, and they find myriad applications in other disciplines as well. In this paper, we seek to improve the state-of-the-art R-tree[2] by introducing sp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999